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SPIR IT is a an Intelligent Tuturmg System for tutoring probability theory which has evolved through a 
continuous process of experimentation and tuning. The system manages a unique flexible tutoring style. 
On one hand the system may behave as a tutor who mostly observes the student without interference, 
intervening only when things are really going wrong, and on the other hand, it may behave as a tutor who 
manages a "questioning and answering" type of dialogue. Hasedon a belief constructed about the student's 
aptitude, the system frequently changes its tutoring style. SPIRIT integrates several artificial intelligence 
methods that include: a ihcorcm prover, a production systenu an object oriented system and procedural 
knowledge embedded in USP code. 

1. Introduction 

Although much progress has been achieved in the lost decade as people have started 

to apply A I methods to the design of tutoring systems, tutoring systems today are far 

less competent than the experienced tutor. One of the reasons for this Is the lack of a 

well f ounded theory of teaching. As Sleeman and Brown ([1], p.3) put it: 

llic tutoring strategics used by these systems arc excessively ad-htx: reflecting unprincipled 
intuitions about liow to control tlieir beliavior. Discovering consistent principles will be facilitated 
by constructing better theories of learning and mislcaming... 

Moreover, because of the complexity of the design of Intelligent Tutoring Systems, AI 
workers have often focused on one or two aspects in I nLelligcnt Tutoring System 
(ITS), rather than developing a complete system that does a satisfactory tutoring job 
[2], i='or example, Brown and Burton [3] in DEBUGG Y and Sleeman and Smith^Hf in " 
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' LMS focused on student modeling. Clancey [2] focused both on the tutoring 
knowledge and on student modeling but his system used MYCIN'S [5] knowledge as 

its domain knowledge, which was not specially built to serve tutoring purposes. 

\ _ . . 

The underlying approach of this research is that since we do not have a good theory 
of learning, an effective ITS should evolve through a process of experimentation and 
tuning. What the system should do can best be identified by experimenting and using 
the system. Therefore, the designer's focus should not be on the effectiveness of the 
various componets such as student modeling, tutoring strategies or domain expertise as 
stand alone parts but on the effectiveness of the system as a whole. 

This approach was used in the design of SPIRIT; an ITS for tutoring probability 
theory that has evolved to be a successful system. During tlie process of developing the 
system the designer's focus ollen shifted from one aspect to another as deemed 
appropriate based on experiments that were continuously conducted witli the system. 
The system integrates various Al methods such as a theorem prover, a production 
system, an object oriented system and conventional i.isp programming; each method 
was selected to do the task for which it is best suited. All together the methods do a 
complete task. 



2. The task: tutoring probability theory 

SPI RIT tutors: —'^^^ 

• Elementary concepts of probability theory and formulation of 
probabjlity problems. 

* Basic probability rules: miilliplication and addition rules, conditional probability, 
rules of intersections and unions. 

- Special cases of probability rules: independent events, mutual exclusive events, 
marginal probability. 

- Bayes' rule. 

e 



A student interacting with the system is assumed to be familiar with the basic concepts 
of probability theory. Word problems in probability are posed to the student, and the 
system follows and guides the student while he is solving the problems, focusing on 
the recognition of the concepts presented by the problem and on the application of 
probability rules in problem solving. 

The first stage of the research with SPIRIT was to investigate the behavior of the 
experienced human tutor in tutoring the relevant subset of probability theory. In 
preliminary experiments five tutoring sessions were recorded and analyzed. Most of 
the tutoring principles that were identified at that stage were implemented in a 
prototype of SPIRIT. However, through the process of experimentation with SPIRIT 
the system has been changed very significantly. Moreover, tutoring principles used by 
the human tutor in the preliminary experiments which were ignored in previous 
analyses became noticeable as more experiments were conducted with the evolving 
system. Before this evoludon process will be described, SPIRITS architecture will be 
reviewed. . 

3. SPIRITS architecture 

The major components ofSPlRIT are depicted in figure 1. In the following, I will 
briefiy describe the function of each componenL 



INSERT FIGURE 1 HERE 



Tutoring expert 

The tutoring expert manages the dialogue between the system and the student, and 
makes most of the important decisions of what tutoring actions to talce. There arc two 
different styles of luioring which the system uses; tutonxndmefUor. The /w/or manages 



a dialogue in which the student is strictly guided and is told exactly what the next 
move is. It first goes over the natural language problem text, possing to the student the 
important sentences and asking him to represent each sentence using probability 
notationsThis process is called working in the "symbolic context". After all the 
important information is extracted from the problem text , the tutor gixidQS the student 
along the solution process. This process uses an AND/OR tree generated by the 
probability expert which will be described below. If the student wishes, he may jump 
ahead and attack any relevant intermediate problem. However, at each point along the 
dialogue managed by the iuior, there is an enforced "understanding" between the 
system and tiie student about what to do next The other style of tutoring is the 
mentor, which manages quite a different style of dialogue. The mentor usually does 
not tell the sti dent what to do. The student uses the terminal us scratch paper. He can 
type whatever he wants. He may start by expressing symbolically the information 
given in the problem text as well as writing some formulas or algebraic expressions. 
The w<?/i/or analyzes each line the student types. From time to time the mentor may 
decide to intervene. It may correct mistakes, encourage, or transfer control to the tutor. 
Between the /u/or and the me/i/or there is a smooth two-way interaction. The student 
cannot always tell with which component he is interacting. The system behaves 
somewhat as a human tutor who changes his style of tutoring on the basis of the 
circumsumces. A summary of all that happens along the tutoring session is 
maintained and is used by the lutonxnd the mentor. In addition, some strategic 
decisions arc made on the basis of a deeper analysis of the student which is done in 
the Student Model. One important decision may be to transfer control to one of the 
specialists (which will be described below) that can handle in depth a basic student's 
misconception. 

Problem Model 

Probability problems to be presented to the student in the tutoring process are 
rcprcsciucd wiihin SPIRIT by means of a problem model. The problem model is built 
by tiic LCiichcr. The events are described and the information needed with respect to a 
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problem is recorded in this model. For example, the problem model would specify if 
, events are independent, mutually exclusive or exhaustive. 

The probability expert 

The probability expert can solve probability problems within the domain covered by 
the system. However, before the expert can solve a problem all the events referred to 
in the problem must bo identined, and the special cases for which probability rules are 
extrapolated should be explicitly described. For example, the probability expert must 
be told if two events are independent because it can not deduce this information from 
the natural language problem text. Tlie conceptualization used by the expert 
component is that of backward chaining through formulas. That is, first the formula 
that yields the final answer to the problem is constructed. Then, for each argument in 
this formula the probability expert attempts to find a formula that may be used to 
calculate the argument The process terminates when reaching terms for which 
numbers given in the question can be substituted directly. This was the prevailing 
conceptualization used by students in the preliminary experiments. The expert 
produces an AND/OR tree by chaining the probability rules. in the backward 
• direction. Then the tree is 'solved', meaning that the desired answer in the root of the 
tree is 'proved'. This is done by propagating the numbei-s in the leaves of the tree 
upward until the final answer is constructed. This process is based on Uie classic notion 
of Backward Deduction System as described in [6]. The AND/OR tree is oftert helpful 
in allowing the system to follow the student's reasoning. 

Error Analysis 

The purpose ofthis component is to identify students' systematic mistakes and to find 
the misconceptions hidden behind the mistakes. This is the same objective underlying 
the development of DEBUGG Y [3]. The error analyzer is composed of several lisp 
subroutines, each subroutine corresponding to a probability rule. Each subroutine 
attempts to map the formula the student wrote to the correct formula. The 
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discrepancies are identified and classified to several types corresponding to the 
assumed misconcepdons behind the mistalce. An effort was made to account for the 
vast majority of mistakes a student may commit. After the error analyzer identifies the 
mistake the student has made, a message is sent to the tutoring expert Then, the 
tutoring expert can make the decision whetiier or not to intervene immediately to 
correct the mistake, and if so in what way. 

Sludcnt Model 

The student model analyzes the student's performance. It makes assumptions about 
the student's overall capability and accumulates knowledge about the student's 
performance with respect to various subskills the student is expected to acquire. The 
system makes an assumption about the capability of the student because not ail 
students should be handled alike. A more competent i^tudcnt receives more complex 
explanations, and interacts mostly with the mentor. A weak student receives 
simplified explanations and interacts mostly with the tutor. 
The system's beliefs are continuously revised and changed based on the 
circumsuinces. The student model is accessible to the tutoring expert and to the 
specialists which are described below, and it influences decisions made by these 
components. 

Spcchilists 

From time to time the tutoring expert may decide to treat in more depth a student's 
fundamental misconception that was revealed in the problem solving process. This is 
done by transferring control to a specialist. The specialists arc i.isp subroutines that 
interact with the student. A specialist queries the student model so as to teach the 
student in a suitable way. The inforr.iation a specialist uses include: what the student 
has done before, and its current belief about his capabilities. Some of the specialists 
that arc currently implemented handle the following issues: independence, marginal 
probability, conditional probability and the union rule. The specialists may present 
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Venn diagrams, may pose questions and may relaie things done previously to the 
current problem. 

The Interface 

The purpose of the interface is to analyze the student input, to parse it and to dec;de 
what the student intends to do. Sometimes the studenris asked to do a specific thing. 
This usually happens when he interacts with the tutor. In tiiis case, the system knows 
what to anticipate. When the student interacts with the mentor, a line typed by the 
student is handled by a component that makes assumptions about the student's 
intentions. The various things that a student may attempt while interacting with the 
tutoring expert are: expressing data symbolically, writing a formula, substituting 
numbers in a pre-writtcn formula, writing an algebraic expression and writing a 
numerical value. In addition the student may request help either because he is 
confused or in order to solve an intermediate subproblem. He may also ask for 
information about either the probability values found or the formulas written up to 
this point. In addition, the terminal can be used as a calculator at almost any point in 
the dialogue. 

4. The iniplcnieiUulion framework 

Since it was clear at an early stage of the research that SPIRIT would evolve and 
change, it was extremely important to choose an appropriate tool to facilitate the 
building and the refinment of the system. The component most naturally subjecte to 
changes and tuning is the tutoring expert Tutoring knowledge has been often 
represented in explicit rules (e. g. Clancey [2], Oshera [7], Collins [8] and Lantz et al. 
[9]). There are some rule-based tools (e.g. i-mycin [10] ) that could have been used in 
SPIRIT. oi»s5 [11], however, seemed to be particularly appropriate for the design of an 
ITS. oi>s5 has the advantage possessed by most rule-based tools; rules are relatively 
independent of each other, and changes in one rule do not necessarily force changes 
other rules. As a result, the system's behavior can be changed relatively easily. In 
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addition to this advantage opss has two other important advantages for die design of an 
ITS. First, the principles of conflict resolution strategy used in opss, specificity and 
recency suit very well (as will be demonstrated below) the tutoring domain. Second, 
the structure of working memory elements that are accumulated in the working 
memory, and of production rules which are sensidve to certain elements corresponds 
nicely to the structure of a tutoring session. Working memory elements may 
correspond to elementary pieces of interaction and represent the elements of 
knowledge the human tutor acquires in a tutoring dialogue. The production rules may 
correspond to various tutoring decisions that ought to be made in certain scenarios 
presented by the working memory. Thus, ores provides a natural way of representing 
the dynamic knowledge accumulated during the dialogue and the static tutoring 
knowledge that corresponds to the human tutor's experience. Moreover, oi>ss is a 
tremendously efficient tool that can reduce system s response time which is a major 
factor for success in a highly interactive system such as an ITS. 

In SPIRIT, productions correspond to tutoring decLsons which manage the dialogue. 
Each working memory element describes an elementary interaction, and indicates 
what the student did and how the system responded. A move taken by the student is 
described by a working memory element which indicats what is the move, whether or 
not it is a correct move and if not what type of mistake has been made. After the 
system takes a tutoring action a working memory element which describes this action 
is introduced into the working memory. With respect to each tutoring decision, there 
are usually several productions, arguing for different tutoring actions. These 
productions actively compete for the right to fire (i.e. for the right to execute their 
action part). The conflict resolution strategy employed provides an intuitively 
appealing approach for resolving these conflicts. 



5. SPIRITS evolution 
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Because SPIRIT has continuously evolved, and was changed almost every day, there 
are no versions or distinct stages in the system's life cycle. However, this section 
divides the evolution process into three stages, for purposes of exposition. The firet 
stage is called tlte base system and includes only the "tutor" tutoring style. The second 
stage includes both the tutor and the mentor partially integrated. The final system is 
the complete tutor-mentor integrated system including student's modeling and 
specialists. 

5.1. The base system - Tollow the tree 

The base system can be viewed as a combination of two sub-systems, called 
"symbolic" and "solve". The symbolic component sequentially presents to the 
student segments of the probability problem, each of which is a part of the text that 
expresses a probability concept. The student is asked to represent each segment by a 
probability expression. After the student has completed expressing symbolically the 
probability word problem, he moves to the solve stage, where he applies probability 
formulas. 

The approach used by the base system is that of questioning and answering. Every 
system response includes a question and every student response includes an imswer to 
a specific question. After ail the segments have been symbolically represented the 
system starts the process of following the AND/OR tree, which is strictly a backward 
chaining approach. The major goal that drives the system is that of finding the solution 
to the probability problem, (rather than instructional goals). Although this approach 
may appear relatively simple, the base system uses several important tutoring 
strategies that were identified in the preliminary experiments involving some design 
complexities. The interesting issues are related to decisions determining what is the 
proper explanation, when to provide it, and when to probe thv: student The extensive 
analysis of mistakes done by the error analyzer also contributed to the system's 
effectiveness. 
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Some of the important tutoring strategies used by the base system were the following: 

1) Probe the student By "probing" we mean pushing the student to be active and to 
do things even when he hesitates or prefers the system to tell him what to do. The 
system does not provide help automatically whenever help is requested. When the 
student types.a "?" there are usually several productions whose left hand side (LHS) 
are satisfied, each arguing for a different tutoring action. Those with a more specific 
LHS have a stronger argument for providing substantial explanation. The production 
whose LHS is more specific would be satisfied only when there has previously been 
some action with respect to the current intermediate problem, so more substantial help 
should be provided. The production whose LHS is less specific would be fired only in 
the cases where the more specific productions are not satisfied, and not much has been 
done (if anything at all) that pertinent to the current intermediate problem. As a 
result, a student asking for help immediately, without making any effort first, would be 
probed, while a student who has been struggling for some time with the current 
intermediate problem, or who has already asked once or twice for help, would receive 
more extensive help. When the student asks for help for the first time after an 
intermediate problem is posed to him, the system always asks the student to make 
some effort and to type something, even if it isjusta guess. The first time in the 
dialogue that the system probes the student, it explains that even though the student 
is not confident of his answer, it is better to guess so the system will have some 
knowledge about his thinking, in most cases, the base system provided substantial 
help after three repetitive help requests, or after a help request which was preceded by 
one or two attempts. As will be discussed in a subsequent section, this behavior of tlie 
system was often inappropriate. 

2) Make the student find the answer by himself. The system attempts to provide 
minimal help while still making sure tl at the student is progressing. This is dune by 
choosing and providing the appropriate explanation. When the student makes a 
mistake, the system first responds by showing him why his answer is incorrect. After 
detecting repetitive mistakes the system begins to address the correct answer more 
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explicitly in its explanations. Although these become more and more elaborated, only 
in the extreme case is the answer provided. This strategy enables each student to use 
just the help he needs, not more nor less. An example of this strategy is the following. 
Suppose the student is expected to write the formula p(A) = p(A n B) + p(A n -B), and 
in his first attempt he writes the formula p(A)=p(A / B) + p(A / -B). The system's 
response would be the following: "You can not add conditional probabilities like this. 
You may get a larg<^r than 1 probability. Try to think about a more basic fonnula 
which docs not involve conditional probabilities. Try again to write a formula for p 
(A)." . This explanation says very little about the correct formula. If in the next 
attempt the student is wrong again the system's response would be more to the point. 
For a typical mistake, the system is capable of providing explanations in about three to 
four levels. The base system always provides first the shallowest explanation and then 
the more elaborated ones. The vast majority of the explanations provide two kinds of 
information, information about what is wrong in the recent mistake and information 
. about the right answer. The shallower explanations focus on the recent mistake, while 
the more elaborated ones focus on the correct answer. A similar tutoring strategy was 
used in the WEST system [12] where four successive levels of hints where used. The 
first addressed tiie student's weakness and successive levels increasingly addressed the 
optimal move to be taken. 

3) Relate things to what has been done previously. This strategy is used in two cases: 
(1) when a student reveals tliat he has a misconception which had been identified and 
dealt with before, (2) when a student reveals a misconception, although previously he 
did something suggesting that he did not have this misconception. Each time the 
student either makes a mistake or requests help, the system examines to see whether ot 
not the concept raised by the current intermediate problem has been discussed or dealt 
with before. If the system identifies that the issue has been discussed, it refers the 
student to the appropriate intermediate problem that raised this issue. In the base 
system, issues are discussed only when the student makes a mistake or requests help 
related to the issue. This is in contrast to the way ihc principle of relating things wiis 
implemented in GUIDON [2]. GUIDON encourages the student when he properly 
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applies a domain rule that p reviously he did not know, but the case of a student failing 
to apply things that previously he knew, is not handled explicitly. An example of this 
strategy in SPIRIT is the following: Suppose the student has difficulties in expressing 
conditional probability, but he has correcdy expressed conditional probability in 
another segment The system in that case would refer the student to the segment that 
he has correctly expressed, and it would indicate that the two segments (the one done 
before and the current one) have the same structure. An example of this strategy in the 
solve stage would be referring the student to an intermediate problem in which he has 
applied the probability formula that is currently required. The productions that 
implement this strategy have priority over other productions that provide explanations 
or probing statements. Therefore, the system always prefers to refer the student to 
what he has done, rather than to take other tutoring actions. 

4) Encouragement strategies. Several strategies that are not dependent on extensive 
student modeling were implemented in the base syistem. Negative feedback is avoided 
as much as possible. The system does not use statements such as "wrong" or 
"incorrect". Instead, the response to a mistake is always a constructive statement that 
tells the student what is wrong with his answer, or provides him with some hint about 
the correct answer. Positive feedback, on the other hand, is given whenever 
appropriate. The student is given the feeling that he has found the answer by himself 
even when he has received substantial help from the system. 

5) Generalization strategies. The process of following the tree is explicitly described 
during the dialogue, thereby demonstrating and teaching the backward chaining 
approach to problem solving. Explanations are provided in general terms using the 
abstract form of formulas (using the symbols A or B instead of the current events' 
names in the specific problem). 

Problems in the base system 

The base system was tested by several subjects that expressed more or less the same 
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concerns about the system's performance.' Subjects complained that while interacting 
with the system they lost the whole picture and they felt as though they were being 
quizzed. That is, the tutoring session seei|ied to them as one in which the tutor asks a 
series of unrelated questions. The subjects who could have performed better were 
prevented from solving the problem by skipping some of the stages or implicitly doing 
some intermediate moves. They also expressed dissatisfaction because the decision of 
what to do next was always being made by the system. Making this decision, they felt, 
was an interesting challenge, and so the^ easily got bored and did not express any 
desire for a long engagement with the system. 

i 

The subjects \yho did not perform well also felt as though they were quizzed, but did 
not mind the system deciding for them what to do next; their major problem was 
related to the interface. The strategy of "probe the student" very often tended to 
humiliate the weak subjects. Subjects generally engaged in a long process of thinking 
before tliey requested help. Even remarks from the experimenter who reminded them 
that a help facility was available did not change their hesitant behavior significantly. 
When a subject eventually requested help, the system's response often told him to try 
again or to give his best guess. Tliis of course might be vfery discouraging. 

Tlie interaction with the system was done using a screen terminal and students used a 
text editor (a full screen editor) to scroll back and forth in case they wanted to see 
things done pieviously. This was a burden for all the students who used the systerr 

5.2. Second round: Adding the mentor 

The concerns that subjects expressed with respect to the base system's approach 
seemed to be prominently related to the fact that the base system did not address the 
student's need for strategic and planning knowledge in problem solving ( [13], [14]). 
The approach that was adopted in the second round was based on the assumption that 
a student should first know the general concepts of probability theory, and how to 
make probability inferences with this knowledge before he can sharpen his strategic 
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knowledge. While the tutoring of knowledge of concepts and of making inferences 
was believed to be done quite satisfactorily in the base sy^jtem (excluding the interface 
problemX the system needed a new approach for 

handling the tutoring of strategic knowledge. This new approach was implemented by 
the mentor tutoring style. 

The idea was that a student should be allowed to select his moves as he wishes, thereby 
practicing his strategic knowledge and revealing his ability to the system in this respect 
The implementation of this idea in a tutoring system is a novel aspect of SPIRIT, 
undertaken to approach more closely the human tutor's behavior, as reflected in the 
tutoring protocols. The major theme of the second round was therefore to use the 
base system's approach for the students who do not have the knowledge of the basic 
probability concepts and rules, and to use the mentor approach for those who have the 
basic knowledge but need tutoring focused on strategic knowledge. 

In the second round there were two systems; the tutor and the mentor. Tliesc two 
systems were not mutually exclusive; rather, the tutor was a subset of the mentor 
system. The tutor as an independent system was similar to the base system witli some 
upgrading as described below. The mentor used the tutor as a sub-system. A student 
when suming lo interact with the system had the choice between the two systems, and 
had only limited option to switch between them making a preliminary choice. 

Improving the tutor 

Tlie strategy of probing Uie student was modified so that the 

student would not feel humiliated. The productions that argue 

for providing niore elaborated explanations received more priority in confiict 

resolution. The system was changed so that the student would never be asked to try 

again more than once while working on any one intermediate problem. Also, the 

revised system always provided meaningful help if the student's help request came 

after some effort had been made with respect to the current intermediate problem. 
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The strategy of probing the student was still used by the system, but it was changed to 
be applied in a more moderate and graceful way. In order to alleviate the problem of 
getting lost as mentioned earlier, two new corafhands were made available to the 
student. The command data displays all the probability expressions as well as their 
values, that have been defined by the student This command freed the student from 
the necessity to scroll^back and forth in order jo see the numbers that were given in the 
problem text. The command /ormw/a has thejsame purpose, but with respect to 
formulas that have been written up to ihe point where the student issues the 
command. The command formula displays all the correct formulas that the student has 
typed in. The tutor system behaved like the/base system with the changes mentioned 
above. / 

/ 

The mentor j 

I 

In the mentor, the problem text is presented to the student who is asked to solve it 
using the screen the same way he would/use scratch paper. The mentor does not force 
the student to start the problem solving process in the symbolic stage nor to adopt the 
system's backward reasoning approach. Instead, the student makes his moves while 
the mentor analyzes his progress, and whenever appropriate it intervenes by taking 
some tutoring action. These actions can be performed cither by the tutor, which is 
instructed to assume control temporarily or by the mentor. All tutoring actions 
(except encouragements) are evoked as a result of mistakes and help requests. 
Encouragements are provided from time to time as deemed appropriate. Thus, a 
student who does not make mistakes and does not request help could solve the 
problem with the intervention of only some system remarks such as "Good" or 
"Correct". In many cases, a mistake made by the student does not evoke the mentor's 
intervention. Thus, the mentor uses the strategy of letting the student discover his 
own mistake. The rationale behind this strategy is that if the system does not correct a 
student's mistake there is a chance that the mistake would lead the student to a dead 
end or to a point where the mistake becomes obvious. When the student, himself, 
identifies that he has made a mistake there is a higher chance that he would remember 
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not to make this mistake again. The mentor has to make a difficult decision of 
whether or not to let the student go in the wrong track, and if so, how far. There must 
be some point from which letting the student go further in the wrong track is a 
counter productive strategy. Deciding where this point is in each case is a difficult 
question. The rules that determine this point were constructed on the basis of 
experiments and on a trial and error process of tuning. There are two classes of cases nx 
which the mentor may decide to intervene: an intervention may be.triggered by th 
student's request for help, or it may be evoked by some strategy that specifies an 
intervention in a particular scenario reflecting by the various moves that the student 
has taken. 

Interventions triggered bv help requests 

The mentor always intervenes if the student requests help while he is on the wrong 
track. When the student requests help and the move preceding the help request 
included a mistake, an assumption is made that the confusion is the result of the 
mistake, and the mentor transfers control to the tutor for a discussion related to the 
last mistake made. The tutor handles this mistake as described in the previous section. 
After the mistake has been fixed the tutor returns control to the mentor. The principle 
of selecting the most recently mentioned issue for discussion was also used both in 
GUIDON [2] and in WEST [12]. In WEST this strategy was called "focus strategy" and 
was compared to the "breadth strategy" which selects for discussion an issue that has 
not yet been discussed. When the student asks for help at the very beginning of the 
problem solving process, the mentor gives the student an opportunity to switch back to 
the tutor. The mentor also recommends in this case to start the problem solving by 
expressinc the data symbolically. A help request which is made neither after a 
mistake nor in the beginning of the problem solving process is understood by the 
mentor as ifthc student were saying "1 don't know what lo do next". In this case the 
mentor looks to sec what is the most recent thing that the student has done. Using the 
AND/OR tree Lho mentor then recommends what the backward reasoning approach 
would dictate lo do next 
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Sometimes the student may need help in findinig some particular probability 
expression, which he believes to be required for solving the problem. He can receive 
this help by issuing a specific help request, i.e. by typing the probability expression 
followed by a "?". There are three possible cases. First, the student may request help 
for finding a probability expression which is irrelevant and is not required. The 
mentor, in this case, would tell the student that he does not need this expression. 
Second, the student may request help to find a probability expression for which either 
its value or a formula has already been found. In this case the mentor would remind 
the student that the analysis of this expression has already been made, and would also 
remind him of the commands daia und formula. In the third case the specific help 
request is relevant, and the mentor would provide some hint about how to find this " 
expression. If the student requests help by typing a "?". following the system's 
response to a specific help request the mentor would understand this request as saying 
"This hint is not sufficient for me. Please give me more hclp to find the probability 
expression that 1 asked you before.". Notice that this is a different interpretation for 
the command than the one mentioned earlier. Thus, the mentor has the ability to 
give more than one interpretation to a command based on the particular context in 
which the command is issued. 

Interventions which are not triggered bv heln requests 

The mentor provides a mechanism that enables the implementation of strategies of 
the form: " if a student has Uiken moves A, B, C, etc. then, do tlie following...". 
Throughout the process of experimenting with the mentor, such strategies can be 
easily added and changed. An example of such a strategy is the following. I f the 
student has more than one mistake in expressing conditional probability, the mentor 
would intervene and correct the student's related mistakes, thereby demonstrating to 
the student that he had a misconception that resulted in these mistakes. Some mistakes 
may evoke immediate intei-vention. An example of such a mistake is the mistake of 
confusing algebraic and set operators. This mistake triggers an intervention because 
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the underlying misconception may cause the student to use improper notation not 
understood by the system, thereby inhibiting further analysis of the student's moves. 
Other interventions are required in order to maintain flow of the dialogue. For 
example, if the student types i)t an algebraic expression which is an instantiation of an 
incorrect formula that was typed in earlier, the mentor would intervene telling the 
student that the formula he is using is incorrect The mentor always forces the student 
to work explicitly, thereby eliminating the possibility of accidently flnding the correct 
answer; this also helps the student acquire the habit of working in a thorough, 
organized manner. Thus, if the student types in an instantiation of a formula (i.e. the 
actual numbers are plugged in), and the formula used has never been written before, 
tlie mentor would ask the student to write the formula that he is using. This is done 
whether the implied formula is corrector not The mentor may decide to intervene in 
order to encourage the student For example, if the student has written an incorrect 
formula, and later he rewrites the formula correctly, the mentor would encourage him. 
The mentor would also intervene to eliminate cycling. That is, if the student takes 
some previous move for the second time, the mentor would mention it to him. 

The problems in the second round 

The system in the second round was able to handle two tutoring styles. However, when 
a student started to interact with the tutor he was forced to continue the interaction 
using this tutoring Style tliroughout the entire problem solving process of at least one 
probability problem. The same is true for a student who started to interact with the 
mentor. The assumption that students can bo classified into two classes, those who 
know the basics and need practicing in the application of strategic knowledge, and 
those that need tutoring in tlic basics, was oversimplified. In the experiments with 
the second round it became apparent that students can not be classified into two 
distinct categories in a siuisfactory way. Rather, the knowledge and skills that students 
have and acquire by using the system, represent a continuity from students who arc 
very knowledgeable and skillTul to those who are ilcw and need much more guidance. 
Moreover, it seemed Unit students needed the "tutor" style of tutoring in some parts of 



tlie session but preferred the "mehtor" tutoring style in other parts. In the 
experiments, some students were confident in the symbolic stage and therefore 
proceeded in the mentor. However, in the solve stage they encountered difficulties. 
Although the mentor often transferred control to the tutor for handling the many 
mistakes that these students made, control was kept most of the time by the mentor. 
Each lime the tutor returned control to the mentor, students received the mentor's 
message "Please go on", which did not tell what to do next, and that discouraged the 
students who did not know how to proceed. For those students it was quite obvious 
that the preliminary decision for using the mentor, while being the right one for the 
symbolic stage, was wrong-for the solve stage. Other observations showed the other 
aspect of the same problem. Some students started the dialogue with shallow 
knowledge. The tutor tutoring style seemed to be appropriate in the beginning of the 
dialogue. However, after some time, the students understood the system's approach of 
backward reasoning, and then wanted to skip some of the simple intermediate moves. 
This of course, was not allowed by the tutor, and therefore the students became 
frustrated. 

A second major problem with the second round system was that some students felt that 
they did not learn enough through the problem solving exercise. The system did not 
provide any in-depth discussion of prubabilily concepts. Wiien the research with 
SPIRIT began, the system was defined as a tutoring system whose purpose was to do 
coaching rather than teaching. That is. an assumption was made that a student 
interacting with the system knows the basic concepts and what he needs is practice. 
Despite the fact that teaching was not SPIRITS intended central task, it seemed from 
the experiments that some teaching capabilities are needed to be incorporated in the 
system. This requirement is not a simple one because, instructional goals are 
sometimes confiicting. In particular, the goal of coaching may contradict the goal of 
teaching basic concepts because elaborate discussions of concepts in the midst of the 
problem solving process may severely disturb the flow of reasoning and inhibit 
autoniatistn [151. 
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The two major problems mentioned above demonstrated the lack of extensive student 
modeling. Tiie knowledge about the student in the second round was represented by 
the list of working memory elements that describe the student's previous moves. This 
knowledge does not provide a specific estimate of the student's aptitude. Such an 
estimate is important in order to decide which tutoring style to use, and on what level 
to provide discussions of probability concepts. Another kind of student modeling is 
required to specify the student's strengths and weaknesses. This infoimation is 
important in order for the system to make plausible decisions about the trade-off 
between coaching and teaching. That is. the decisions of when and what to discuss in- 
depui. 

Some problems were observed in the mentor and in the tutor working as separate 
systems. Despite the commands data fxndformuia, students interacting with the mentor 
sometimes lost context. An assumption was made in the mentor that backward 
reasoning is simple straight forward reasoning. For example, suppose a student has 
written the formula p(B / A) = p(A n B) : p(A), and suppose also that he has found 
p(A n B). At this point a help request is understood as saying "What should I do 
next?". The mentor would respond to the help request by saying: 'Think about 
finding p(A)". In the experiments, when some students encountered this scenario they 
were in doubt about why the mentor was telling them this. That is, they did not 
understand the backward reasoning underlying the mentor's recommendation. These 
results implied that in the mentor the system's approach should be more explicitly 
articulated. 

Some sLudcnts interacting with the tutor, although feeling comfortable with the 
tutoring style, were sometimes inhibited from skipping simple intermediate moves. 
For these students a switch from the tutor to the mentor would not be appropriate 
because they generally preferred the tutor to the mentor. Thus a mechanism that 
would allow jumping ahead in the tutor also seems desirable. 
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5J. SPIRIT: Reaching the current state 

The problems discussed above motivated the incorporation of extensive student 
modeling, specialists that discuss in detail important probability concepts, and the 
integration of the tutor and the mentor yielding the current state of SPI RIT. 

Student modeling 

The purpose of student modeling is twofold: first to construct and mainiain a belief 
about the student's aptitude, and second to represent knowledge about the student's 
weak and strong points. The first purpose is accomplished by general modeling 
capability and the second by sub-skills modeling. 

The Student Model is implemented using an object oriented programing language 
called iiousi; that was developed in the Decision Sciences Laboratory of the University 
Of Pittsburgh by Casey Quayle.This language uses some of the ideas of Smalltalk [16] 
and of I-LAVORS [17], 

The object oriented paradigm seems appropriate for the implementation of the 
Student Model because: 

- The Student Model has a dual task. It represents data about the student, and it 
performs analysis on this data making assumplions about ihc student's ability. Tims, it 
seems appropriate to combine diese two tasks in objects. 

- The Student Model includes some entities corresponding to sub-skills the student is 
supposed to acquire, and that use uniform protocols. These entities are represented in 
objects, which respond to a uniform set of messages, thereby facilitating the 
implementation: This is, all the sub-skill objects respond to the same message format 
and perform similar processing. 

- The use of objects suits the evolving characteristic of the system. For example, new 
sub-skills can easily be added, and a change in the representation of the data can be 
done only once, and not as many times as the number of sub-skills. 

♦The object oriented paradigm faciliuited the implementation of the modular 
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architecture in SPIRIT. The Student Model is interfaced to the other system*s 
components by messages. Thus, the other components do not have to assume any 
particular data structures in the Student Model, and any change in the data structures 
does not necessarily impose changes in the other components. 

General Modelirig 

Each move that the student takes reveals some knowledge about the student's 
aptitude. At any point in the dialogue, the system's belief about the student's aptitude 
is based on the cumulative evidence derived from the moves that the student has 
taken so far. An additional move taken by the student changes this information and 
therefore can change the belief that was held before. Thus, system's belief must be 
revised after each move taken by the student. There are two measures associated with 
each possible move. One is a measure of difficulty and the second is a measure of 
correctness. The first measure specifies the degree of difficulty in making the move ■■ 
correctly. The second specifies the degree to which the student is close to the correct 
move. A move whose two measures arc high supports the belief that the student's 
aptitude is high. The system's belief is constructed based on these two measures of all 
the moves taken so far. The,systcm*s belief about the student's aptitude is used for 
several purposes: it is used for selecting the proper tutoring style: the specialists use it 
for uiiloring the level of discussions to the student's ability: and it is used in order to 
provide the student with an evaluation at the end of the tutoring session. 



Sub-skills modeling 

The system needs to know how the student has done with respect to each sub-skili 
involved in problem solving. The sub-skills modeling provides a mechanism by which 
the system can easily examine the student's previous moves with respect to each sub- 
skill. The sub-skills modeling component does not make decisions. It merely 
represents information in a way that facilitates the analysis done by the specialists. For 
a detailed description of the sub-skills modeling process see [18]. 

Specialists 



26 



Th^ purpose of the specialists is to discuss in detail basic probability concepts. A 
discussion evoked by a specialist includes two parts, an abstract discussion of the 
concept and an analysis of the student's previous moves pertinent to the current 
discussion. The specialists serve the purpose of teaching the basics, which sometimes 
contradicts, as I mendoned earlier, the instructional purpose of pursuing automatism. 
Therefore, a specialist is called only when a student reveals that he lacks the basic 
knowledge in the corresponding concept The system reaches this conclusion when 
several hints and explanations in various levels have failed in helping the student In 
this case, a specialist is called and assumes control. 

A specialist can be called more than once, but it does not present the abstract 
discussion a second time, it only reminds the student about the previous discussion, 
and analyzes his previous moves. The abstract discussion is tailored to tlie student's 
aptitude. A student who has high aptitude receives a more challenging presentation of 
the cf oept that often includes abstract questions. Tlie discussion may also include 
topics which are not presented to the student who has low aptitude. For example, the 
marginal probability specialist tolls the high aptitude student that the two arguments 
of the marginal formula are mutually.cxclusive and guides him by asking him first to 
write the formula in set notation. The student who has low aptitude receives a more 
straightforward explanation aimed directly at the final formula. In the second part of 
the discussion, the specialist examines the student's model to see if he has taken other 
moves that are related to the current issue under discussion. 

Tutor-mentor transitions 

The framework of two separate systems was not satisfactory in the second round. The 
approach adopted in the final system was that the system should be able to change its 
tutoring style as appropriate throughout the dialogue. At certain points in the dialogue 
either the system or the student may wish to change the current tutoring style. The 
student does not know the internal system structure and he is not aware of the two 
distinct tutoring styles and of the frequent transitions between them. However, the 
student may initiate a transition from tutor to mentor by telling the system: "I do not 
want to answer your question so leave me alone and let me solve the problem my 
way.". The student can say this by responding to a specific question asked by the 
system by typing the command "alone". Although the student does not know about 
the concepts of "tutor" and "mentor" he would never type the command "alone" 
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while interacting with the mentor because only the tutor asks specific questions. Thus» 
the student sees one system that changes its tutoring style from time to time, taking 
into account his preferences. 

The system initiates transitions when it seems appropriate based on the student's 
aptitude and his recent moves. In the tutor, there are three points, called transition 
points, where the system considers whether or not to switch to the mentor. The first 
point is in the beginning of the dialogue, the second is when the student completes the 
symbolic stage, nnd the third point is after the final solution is found and before the 
next problem is presented. At these points, the system initiates a transition from the 
tutor to the mentor if the student's aptitude is sufficiently high. Much more flexibility 
in intitiating transitions is available in the mentor, which transfers control to the tutor 
whenever it seems appropriate. The tutor handles the misuike that caused the 
transition and then returns control back to the mentor. Thus, most of the transitions 
from the mentor are temporary. The mentor is also capable of transferring control to 
the tutor for handling the rest of the dialogue (or until ^the tutor would initiate its own 
transition) at certain points when the student's aptitude is Judged relatively low. 

Alleviating some of the other problems 

In the second round the mentor improperly assumed that backward reasoning is a 
straightforward approach and that the student can easily use it. The mentor in the 
fin.:! system makes backward reasoning explicit rather than implicit. 

Another problem in the second round was that the tutor did not allow skipping of 
moves. In the final system the tutor allows this by enabling ihe specific help request, 
which was used previously only in the mentor. When a student wishes to skip some 
moves, he types the probability expression that he wants to explore followed by a "?". 
The tutor then makes this expression the current focus of the discussion, and proceeds 
as though it had reached this expression through the usual backward chaining process. 

6. Discussion 

SPIRIT has been used by undergraduate students taking an introductory course in 
probability theory. Their reaction to the system is very encouraging, and we intend to 
report results of experimcnus in the near future. However, the system has some 
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drawbacks and another round might improve the system even more. 

SPIRITS shortcomings seem to be rooted in the inadequacy of its tutoring model 
One problem is the feeling expressed by both subjects and observers of the 
experiments that the system, does not focus on meaningful learning [19]. Meaningful 
learning occu«-s when the material that is learned is related to some general structure 
or principle. Often, after meaningful learning, individuals are better able to transfer 
their knowledge to new kinds of problems. In contrast, subjects who interacted with 
SPIRIT, did not show the desired ability to transfer knowledge they acquire while 
solving one kind of problem to solving another kind of problem. In the experiments, 
students solved two typical Bayesian problems followed by a problem that required 
the application of the addition rule. They were able to transfer knowledge they had 
acquired in solving the fii-st problem to the second one because both problems have a 
similar structure. However, they failed in the third problem, being unable to apply 
effectively the strategy of backward reasoning they used before. 

By comparing protocols generated by SPIRIT to the human tutor's protocols, it 
appears that the system does not emphasize enough the general principles and 
concepts of probability theory, but rather focuses on the task of getting the final 
answer to the current probability olem. The human tutor pursued two goals which ' 
sometimes conflict with each other. ne (Irst goal is making the student understand 
the concepts and the foundations of probability theory. That is, the tutor would like 
the student to be able to derive and to build an abstract representation of the real 
woHd in terms of probability concepts as is represented by the English sentences in the 
probability problem. The second goal is to teach the student how to solve similar 
probability problems. Wriilc the first goal is expressed by the cognitive outcome that 
we want to achieve, the second is expressed in terms of acquiring an applicable skill. It 
seems that it is possible to achieve one goal without achieving the other. For example, 
the student may be able to solve probability problems by playing in a systematic way 
with the formulae, but without having tlie representation of the worid in terms of 
probability concepts. SPIRIT focuses on the second goal and does not suffuciently 
pursue the first one. 

A second major problem with SPIRIT, which is related to what wc have discussed 
above, is the problem of losing context, and losing or forgclling pieces of knowledge 
provided by the system. That is, subjects often forget what the final question is, and 
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what the intermediate problem is, which is the focus of their attention. In an earlier 
stage of the system, one of the reasons for that problem was hypothesized to be the fact 
that SPIRIT had (incorrectly) assumed that the student knew the domain 
independent problem solving method of backward chaining. This assumption was 
embedded in several aspects of the system's behavior. Later, this assumption was 
refuted and the system, currently, explicitly teaches backward reasoning. Despite this, 
although experiments conducted al\er SPIRITS behavior has been changed showed 
some improvement in keeping context losing context still remains a major complaint 
This unexpected result implies that something more basic is wrong in SPIRIT. In 
Barzilay [18] it was hypothesized that one of the reasons for SPlRlTs shortcomings is 
its inability to tie various issues together as the human expert tutor does. Based on this 
hypothesis, Barzilay proposed a framework for enhancing the tutoring model in 
SPIRIT. This framework uses a lattice data structure of probability concepts to be used 
by the tutoring exert in making decisions about which concepts to discuss at what 
points in the dialogue, and to what other concepts they should be tied. 

SPIRITS success is paitly due to oi>ss that facilitated the evokition of the system. 
However, despite the advantages of oi>ss discussed earlier, oi>ss has some disadvantages 
that were overcome by integrating ows with other Al methods. The important 
disadvantages were: 

- oi*s5 is weak in list processing. 

- No processing is allowed in a production's LHS, so one logic rule often needs to be 
implemented in several interrelated productions making the explicit knowledge in the 
logic rule segmented and less explicit Also, debugging becomes tricky because of the 
interrelated productions. 

■ Productions can not be organized hierarchically. The "context" mechanism suits a 
linear organization (e.g. as in Rl, [20]), but docs not suit ihc hierarchical structure in 
SPIRIT very well. A tangled hierarchy is even more difficult to implement using this 
mechanism. 

- Complex data structures are not supported. The only knowledge structure which can 
be directly accessed by 0PS5 is the list of working memory elements. 
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7. Conclusions 

We presented a brief description of SPIRIT by taking three snapshots of the system's 
behavior along the evolution process. Our philosophy is that an effective system 
evolves through a process of experimenting and tuning. A system needs a long tuning 
process in order to intelligently make tutoring decisions, such as deciding when things 
are really going wrong and that an intervention is required, or when it is better to let 
the student struggle by himself Although, presently, the decisions that SPIRIT makes 
are not always the ones that the well experienced tutor would, wc believe we have 
made quite a lot of progress since we started to experiment with SPIRIT. For 
example, the idea of achieving a flexible dialogue style by employing the luior and the 
wc/i/or came to our mind after experiments with SPIRIT in its early stages. Despite the 
fact that this idea meant a significant change in SPI RITs behavior, the 
implementation was completed in a very short time. 

The methods of production system and object oriented programming proved to be 
very helpful in the evolutionary design of SPIRIT. All together SPIRIT is 
implemented by three programming paradigms: procedure oriented (i.isi»), rule 
oriented (ops5) and object oriented (iiousi:). llius, the research in SPI RIT demonstrates 
the need for Al programming environments that support several paradigms. We hope 
that our work will help AI designers in the difficult task of choosing the right tools for 
the right tasks. 

SPIRIT is one of the very few ITS that actually do a satisfactory tutoring job. Students 
in the University of Pittsburgh used the system as an assistant in the introductory 
probability course. Most of them expressed real enthusiasm and after passing the 
course attributed some of their success to the system. 

SPI RIT covers only a small subset of probability theory. However, in terms of 
complexity and size, the system is quite large. It employs about 120 oi«s5 productions 
and some 100 lisp subroutines. It has been developed over a period of 18 months and 
it runs.(with a reasonable response time) on a vax 11/780. 
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